A Hybrid CNN and RNN Variant Model for Music Classification
نویسندگان
چکیده
Music genre classification has a significant role in information retrieval for the organization of growing collections music. It is challenging to classify music with reliable accuracy. Many methods have utilized handcrafted features identify unique patterns but are still unable determine original characteristics. Comparatively, using deep learning models been shown be dynamic and effective. Among many neural networks, combination convolutional network (CNN) variants recurrent (RNN) not significantly considered. Additionally, addressing flaws particular model, this paper proposes hybrid architecture CNN RNN such as long short-term memory (LSTM), Bi-LSTM, gated unit (GRU), Bi-GRU. We also compared performance based on Mel-spectrogram Mel-frequency cepstral coefficient (MFCC) features. Empirically, proposed Bi-GRU achieved best accuracy at 89.30%, whereas hybridization LSTM MFCC 76.40%.
منابع مشابه
A Hybrid CNN-RNN Alignment Model for Phrase-Aware Sentence Classification
The success of sentence classification often depends on understanding both the syntactic and semantic properties of wordphrases. Recent progress on this task has been based on exploiting the grammatical structure of sentences but often this structure is difficult to parse and noisy. In this paper, we propose a structureindependent ‘Gated Representation Alignment’ (GRA) model that blends a phras...
متن کاملRelation Classification: CNN or RNN?
Convolutional neural networks (CNN) have delivered competitive performance on relation classification, without tedious feature engineering. A particular shortcoming of CNN, however, is that it is less powerful in modeling longspan relations. This paper presents a model based on recurrent neural networks (RNN) and compares the capabilities of CNN and RNN on the relation classification task. We c...
متن کاملA Hybrid Hmm-rnn Model for Optical Music Recognition
Optical music recognition (OMR) serves as one of the key technologies in Music Information Retrieval by mining symbolic knowledge directly from images of scores. A fullfledged OMR system encompasses both image recognition and music interpretation to convert image data to symbolic representations. This process has proved to be remarkably challenging, so the state-of-the-art OMR systems still lea...
متن کاملUnconstrained OCR for Urdu using Deep CNN-RNN Hybrid Networks
Building robust text recognition systems for languages with cursive scripts like Urdu has always been challenging. Intricacies of the script and the absence of ample annotated data further act as adversaries to this task. We demonstrate the effectiveness of an end-to-end trainable hybrid CNN-RNN architecture in recognizing Urdu text from printed documents, typically known as Urdu OCR. The solut...
متن کاملCNN based music emotion classification
Music emotion recognition (MER) is usually regarded as a multi-label tagging task, and each segment of music can inspire specific emotion tags. Most researchers extract acoustic features from music and explore the relations between these features and their corresponding emotion tags. Considering the inconsistency of emotions inspired by the same music segment for human beings, seeking for the k...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applied sciences
سال: 2023
ISSN: ['2076-3417']
DOI: https://doi.org/10.3390/app13031476